Method and system for displaying similar email messages based on message contents

ABSTRACT

A method and system for identifying changes to a data set, such as data within a mailbox, and performing actions based on the identified changes is discussed. In some examples, the system receives an indication of a change to a mailbox, creates a change journal entry for the change, and identifies data to be copied via the change journal entry. In some examples, the system leverages the change journal to associate messages with changes to a mailbox.

CROSS-REFERENCE TO RELATED APPLICATIONS

This application is a continuation of U.S. patent application Ser. No.15/360,036, filed on Nov. 23, 2016, entitled METHOD AND SYSTEM FORDISPLAYING SIMILAR EMAIL MESSAGES BASED ON MESSAGE CONTENTS, now U.S.Pat. No. 9,967,338, which is a continuation of U.S. patent applicationSer. No. 13/759,283, filed on Feb. 5, 2013, entitled METHOD AND SYSTEMFOR DISPLAYING SIMILAR EMAIL MESSAGES BASED ON MESSAGE CONTENTS, now,U.S. Pat. No. 9,509,652, which is a divisional of U.S. patentapplication Ser. No. 12/548,953, filed on Aug. 27, 2009, entitled METHODAND SYSTEM FOR LEVERAGING IDENTIFIED CHANGES TO A MAIL SERVER, now U.S.Pat. No. 8,370,442, which claims priority to U.S. Provisional PatentApplication No. 61/093,148, filed on Aug. 29, 2008, entitled METHOD ANDSYSTEM FOR LEVERAGING IDENTIFIED CHANGES TO A MAIL SERVER, each of whichare incorporated by reference in their entireties.

This application is related to U.S. patent application Ser. No.11/694,869, filed on Mar. 30, 2007 (now U.S. Pat. No. 7,882,077),entitled METHOD AND SYSTEM FOR OFFLINE INDEXING OF CONTENT ANDCLASSIFYING STORED DATA, and U.S. patent application Ser. No.11/564,119, filed on Nov. 28, 2006 (now U.S. Pat. No. 7,668,884),entitled SYSTEMS AND METHODS FOR CLASSIFYING AND TRANSFERRINGINFORMATION IN A STORAGE NETWORK, each of which is hereby incorporatedby reference.

BACKGROUND

Processes that typically copy, backup, or duplicate data, such asMicrosoft Exchange data (email messages, mail settings, and so on), areoften laborious, time-intensive processes. The typical backup processconnects to each user's mailbox and compares the entire contents (i.e.,every message) of the mailbox with a previous backup copy of thatmailbox. Often, the backup process will access every message in themailbox to determine if anything has changed since a previous copyprocess occurred. Then, the backup process can perform a copy or backupoperation, only after identifying the changes from the entire mailbox.

Additionally, typical email systems present changes to a user by onlyupdating the user's mailbox with the change (such as by displaying anewly received email message at the top of a list of emails). However,certain emails may be related or similar to other previous messages, andalthough email systems can sort emails via simple header information (byuser, date received, or alphabetically by subject), there are manyinstances where it may be advantageous to a user to employ an emailsystem that provides other benefits.

There is a need for a system that overcomes the above problems, as wellas providing additional benefits.

SUMMARY

Described herein are a system, method and computer-readable storagemedium storing instructions for controlling a computer system to performa method of transferring an email message to a secondary copy of a datastore associated with a mailbox. The method includes accessing an eventsync file associated with a mailbox, wherein the event sync the includesindications of changes made to electronic mail messages within themailbox, and creating a change journal to include entries associatedwith the changes made to the electronic mail messages within themailbox. The method further includes identifying the changes made to theelectronic mail messages within the mailbox from information within thechange journal entries, and transferring the changes to the electronicmail messages within the mailbox to a secondary copy of data associatedwith the mailbox.

BRIEF DESCRIPTION OF THE DRAWINGS

FIG. 1 is a block diagram illustrating a data storage system forcreating a secondary copy of data files having individual discrete dataobjects, such as emails in a Microsoft Exchange mailbox.

FIG. 2 is a flow diagram illustrating a routine for copying MicrosoftExchange data.

FIG. 3 is a table illustrating a data structure containing log entriesof changes to a mailbox.

FIG. 4A is a flow diagram illustrating a routine for updating an indexof content.

FIG. 4B is a flow diagram illustrating a routine for updating an indexof data classification information.

FIG. 5 is a flow diagram illustrating a routine for updating a mailboxbased on a change to the mailbox.

FIG. 6 illustrates a mailbox presenting a list of messages to a userbased on content of a received message.

FIG. 7A illustrates a display screen on a mobile device.

FIG. 7B illustrates the display screen of FIG. 7A modified based on achange to a mailbox.

DETAILED DESCRIPTION Overview

A method and system for identifying, copying, and leveraging changes toa data set, such as a data set on a Microsoft Exchange or other mailserver, is disclosed. The system receives an alert or other signal froma mail server indicating a change to a data set at the mail server,stores an indication of the event in a log or other data structure,queries the log for information related to the event, and performs (or,initiates) a data storage operation based on results of the query.

For example, when an email message is deleted by a user in MicrosoftOutlook, a supporting Exchange Server updates a synchronization file(such as an Event Sync file) to indicate that an event (such as an SMTPevent), the deletion, has occurred within the user's mailbox. The systemaccesses the synchronization file, identifies the event, and storesinformation about the event (such as path information related to thelocation of the event and the type of event) into a log file, or changejournal. Later, a data storage component accesses the change journal,queries the change journal to identify changes that have occurredduring, a certain time period (such as since the last data storageoperation), and uses the results of the query to determine changes tothe mailbox, and copies or performs a backup of the changes. A change toa mailbox may be a received message, a moved message (such as from onefolder to another), a deleted message, and so on.

In some examples, the system updates an index of content based on andafter identifying changes to a mailbox via a change journal. A contentindexing system may update an index associated with a mailbox or otherdata store by accessing the change journal to identify changes to themailbox and indexing content related to those changes.

In some examples, the system updates an index of data classificationbased on and after identifying changes to a mailbox via a changejournal. A data classification system may update an index associatedwith a mailbox or other data store by accessing the change journal toidentify changes to the mailbox, and classifying data related to thosechanges.

In some examples, an email system may update, present, or modify thecontents of a mailbox based on identifying changes to the mailbox. Theemail system may extract the content or classification of data within achanged email message, associate other messages similarly classified orcontaining similar content, and modify the mailbox to present theassociated messages along with the changed message to a user. The systemmay synchronize a mailbox on a mobile device based on a change to themailbox. In some cases, the system may modify the presentation of amailbox to a user based on a change to the mailbox. The system mayperform one or more actions based on a change to the mailbox.

The system will now be described with respect to various embodiments.The following description provides specific details for a thoroughunderstanding of, and enabling description for, these embodiments of thesystem. However, one skilled in the art will understand that the systemmay be practiced without these details. In other instances, well-knownstructures and functions have not been shown or described in detail toavoid unnecessarily obscuring the description of the embodiments of thesystem.

The terminology used in the description presented below is intended tobe interpreted in its broadest reasonable manner, even though it isbeing used in conjunction with a detailed description of certainspecific embodiments of the system. Certain terms may even be emphasizedbelow; however, any terminology intended to be interpreted in anyrestricted manner will be overtly and specifically defined as such inthis Detailed Description section.

Suitable System

The system may create a secondary copy of a data set, such as a storagegroup containing one or more mailboxes, as part of an existing backupschedule performed by an organization. For example, an organization mayperform weekly backups that contain a complete copy of theorganization's email data. The system may create secondary copies usingvarious data storage operations, such as snapshots, continuous datareplication, and so on. Secondary copies may include backup copies,auxiliary copies, archive copies, and so on.

Referring to FIG. 1, a block diagram illustrating a data managementsystem 100 for creating a secondary copy of Microsoft Exchange data isshown. FIG. 1 and the following discussion provide a brief, generaldescription of a suitable computing environment in which the system canbe implemented. Although not required, aspects of the system aredescribed in the general context of computer-executable instructions,such as routines executed by a general-purpose computer, e.g., a servercomputer, wireless device or personal computer. Those skilled in therelevant art will appreciate that the system can be practiced with othercommunications, data processing, or computer system configurations,including: Internet appliances, network PCs, mini-computers, mainframecomputers, and the like. Indeed, the terms “computer,” “host,” and “hostcomputer” are generally used interchangeably herein, and refer to any ofthe above devices and systems, as well as any data processor.

Aspects of the system can be embodied in a special purpose computer ordata processor that is specifically programmed, configured, orconstructed to perform one or more of the computer-executableinstructions explained in detail herein. Aspects of the system can alsobe practiced in distributed computing environments where tasks ormodules are performed by remote processing devices, which are linkedthrough a communications network, such as a Local Area Network (LAN),Wide Area Network (WAN), Storage Area Network (SAN), Fibre Channel, orthe Internet. In a distributed computing environment, program modulesmay be located in both local and remote memory storage devices.

Aspects of the system may be stored or distributed on computer-readablemedia, including magnetically or optically readable computer discs,hard-wired or preprogrammed chips (e.g., EEPROM semiconductor chips),nanotechnology memory, biological memory, or other data storage media.Indeed, computer implemented instructions, data structures, screendisplays, and other data under aspects of the system may be distributedover the Internet or over other networks (including wireless networks),on a propagated signal on a propagation medium (e.g., an electromagneticwave(s), a sound wave, etc.) over a period of time, or they may beprovided on any analog or digital network (packet switched, circuitswitched, or other scheme). Those skilled in the relevant art willrecognize that portions of the system reside on a server computer, whilecorresponding portions reside on a client computer, and thus, whilecertain hardware platforms are described herein, aspects of the systemare equally applicable to nodes on a network.

The data management system 100 includes a data storage system 110 incommunication with a mailbox group 140 that contains one or moremailboxes 150, such as a user1 mailbox 151, a user2 mailbox 152, and auserN mailbox 153. For example, the mailbox group may be a MicrosoftExchange group that manages various user mailboxes 150. The data storagesystem 110 and the mailbox group 140 may communicate over wired orwireless connections, such as via a storage network.

The data storage system 110 may include a log component 111, such as acomponent that stores a change journal, a copy component 112 thatinitiates or facilitates the performance of data storage operations, andother components 113, such as components that communicate with a dataclassification system 120, a content indexing component 130, and/orother components under management by the system.

The copy component 112 may transfer data to other components (not shown)of the data storage system 100 that transfer data to secondary storagemedia, such as magnetic tape, optical disks, solid-state media, and soon. The data storage system may contain some or all of the followingcomponents, depending on the needs of the system. For example, the datastorage system 100 may contain a storage manager, one or more clients,one or more media agents, and one or more storage devices. The storagemanager controls the media agents, which are responsible fortransferring data to storage devices. The storage manager includes ajobs agent, a management agent, a database, and/or an interface module.The storage manager communicates with client(s). One or more clients mayaccess or receive data to be stored by the system from a database via adata agent. For example, the clients may access data from one or more ofthe mailboxes 150 upon receiving instructions from the copy component112. The system uses media agents, which contain databases, to transferand store data into storage devices. The client databases may containdata files and other information, while media agent databases maycontain indices and other data structures that assist and implement thestorage of data into secondary storage devices, for example.

The data storage system may include software and/or hardware componentsand modules used in data storage operations. The components may bestorage resources that function to copy data during storage operations.The components may perform other storage operations (or storagemanagement operations) other that operations used in data stores. Forexample, some resources may create, store, retrieve, and/or migrateprimary or secondary data copies. Additionally, some resources maycreate indices and other tables relied upon by the data storage systemand other data recovery systems. The secondary copies may includesnapshot copies and associated indices, but may also include otherbackup copies such as HSM copies, archive copies, and so on. Theresources may also perform storage management functions that maycommunicate information to higher level components, such as globalmanagement resources within a federated data storage system. Furtherdetails regarding suitable data storage systems may be found incommonly-assigned U.S. patent application Ser. No. 11/982,324, filed onOct. 31, 2007, entitled SYSTEMS AND METHODS OF HIERARCHICAL STORAGEMANAGEMENT, SUCH AS GLOBAL MANAGEMENT OF STORAGE OPERATIONS, which isincorporated by reference it its entirety.

In some examples, the system performs storage operations based onstorage policies, as mentioned above. For example, a storage policyincludes a set of preferences or other criteria to be considered duringstorage operations. The storage policy may determine or define a storagelocation and/or set of preferences about how the system transfers datato the location and what processes the system performs on the databefore, during, or after the data transfer. In some cases, a storagepolicy may define a logical location in which to transfer, store or copydata from a source to a destination data store, such as storage media.Storage policies may be stored in the storage manager, or may be storedin other resources, such as a global manager, a media agent, and so on.

The log component 111 may access and/or communicate with componentsassociated with a mail server, such as an event sync component, in orderto identify changes in a data set. The log component 111 may create,update, modify, and/or store one or more logs of content, such as changejournals. A change journal stores a journal entry whenever data ischanged within a computer system. The change journal generally containsa step-by-step, sequential, or ordered log of what data changed and howthe data changed that can be processed at a later time to recreate thecurrent state of the data.

In some examples, the log component 111 stores a journal entry uponidentifying a change within storage group 140. For example, the logcomponent may access the event sync the of an Exchange server and storejournal entries for all events identified in the event sync file.

Additionally, the log component 111 (or a separate component), maycreate an associated log or other data structure to parse the datawithin the change journal. For example, the log component 111 may createa SQL-based file to later query the SQL-based file when required, suchas when a backup of a mailbox is to be performed.

Backing Up a Mailbox Based on Changes to the Mailbox

As discussed herein, there is a standard mechanism in MicrosoftExchange, called an event sync mechanism, that sends a signal wheneverthere is a change in a mail server or storage group, such as a mailbeing sent, deleted, moved, or received. As discussed herein, aspects ofthe data storage system leverage the event sync mechanism (and othersimilar mechanisms) in order to quickly and efficiently copy and/orbackup a storage group, such as a collection of mailboxes.

Referring to FIG. 2, a flow diagram illustrating a routine 200 forcopying Microsoft Exchange data is shown. In step 210, the systemreceives a signal from a sync mechanism associated with a mail server,such as the event sync mechanism associated with an Exchange server. Thesignal indicates an event has occurred within a storage group, such aswithin the user1 mailbox 151. The event may be a message has beenreceived at the mailbox, a message has been moved within the mailbox, amessage has been deleted from the mailbox, and so on. The system mayinclude a component, such as log component 111, located between a mailserver, such as storage group 140, and a sync mechanism for the mailserver, in order to receive signals intended for the sync mechanism.Alternatively, the log component 111 may access a sync mechanism toextract events, such as events that have occurred in a certain timeperiod.

In step 220, the system writes the signal indicating the event to a logfile, such as a change journal stored within the log component 111. Forexample, the system creates a journal entry for every event indicated bythe sync mechanism. The journal entry may include path information forthe event (such as an identification of the mailbox, a date and time ofthe event, and so on) and event information (such as the type of event).Further details regarding a log file are discussed with respect to FIG.3.

Referring to FIG. 3, a table illustrating a data structure 300containing log entries of changes to a mailbox is shown. The datastructure 300 includes journal entries for changes within a mailbox fora given period of time. The entries may include path information 310 forthe change and flag information 320 that indicates the type of change.For example, the entry 330 indicates a change corresponding to the usedmailbox 151 receiving a message. Of course, the data structure mayinclude other information not shown in the Figure.

In step 230, the system queries the log file to extract information. Insome cases, the system may transfer the log file information to a localtable, such as a SQL database, and query the SQL database to extractinformation. For example, the system may query the SQL database todetermine what changes have occurred from a first time to a second,later time.

In step 240, the system receives results of the query, such as anidentification of all changes that occurred within the mailbox between afirst time and a second time. For each change, the system may indicate apath to the change, and a type of the change, as shown in data structure300.

In step 250, the system performs a data storage operation associatedwith the identified changes within the mailbox. For example, the systemmay transmit the extracted path information 310 and corresponding typeinformation 320 of a received message (an identified change) to the copycomponent 112, which may then instruct other data storage components tocreate or update a secondary copy that includes the contents of thereceived message, as discussed herein.

Indexing the Content of Changes to a Mailbox

In some examples, the system updates an index of content associated witha data store, such as a mailbox, based on identifying changes to themailbox as described herein. Once the system identifies changes to amailbox, the system may then index the content of the messages, and thenupdate an index associated with the mailbox.

In some cases, the content indexing system 130 may update a contentindex according to an indexing policy. An indexing policy is a datastructure that stores information about the parameters of an indexingoperation. For example, an organization may copy changes to a mailserver on a daily basis, and may create an indexing policy thatspecifies that the index is updated on a daily basis, even if backupoperations are not performed daily.

The content indexed by the content indexing system 130 may be some orall content associated with an email message. Some example content to beindexed includes sender information, recipient information, subjectinformation, message type (such as a sent or received message), textwithin the body of the message, attachment information (such as name orsize of the attachment, or content within the attachment), and othermetadata associated within the message.

Referring to FIG. 4A, a flow diagram illustrating a routine 400 forupdating an index of content for a mailbox is shown. In step 410, acontext indexing system 130 identifies a message to be indexed within alog file that stores changes to a mailbox. For example, the contentindexing system 130 may access a change journal 300 and identify entriescontaining path and type information for changes to a mailbox.

In step 420, the content indexing system accesses the messagesidentified by the path and type information, and indexes the content ofthe messages. For example, the content indexing system extracts dataassociated with the sender, recipient, and subject line for all receivedmessages within the change journal. Further details regarding theindexing of content may be found in U.S. patent application Ser. No.11/694,869, filed on Mar. 30, 2007, entitled “METHOD AND SYSTEM FOROFFLINE INDEXING OF CONTENT AND CLASSIFYING STORED DATA,” which isincorporated by reference in its entirety.

In step 430, the content indexing system updates the index to includethe indexed content. For example, the content indexing system createsentries to an index associated with the mailbox for all receivedmessages, modifies entries to the index for all moved messages, anddeletes entries to the index for all deleted messages. Thus, the contentindexing system 130 may leverage the change journal to update an indexassociated with a mailbox. The content index may then facilitate contentspecific presentations of mail messages to users, to be discussedherein.

Classifying the Data Within Changes to a Mailbox

In some examples, the system updates an index for a data store toinclude a classification of data associated with changes to a mailbox.The index may describe certain pertinent aspects of the mailbox thatallow a user or system process to consult the index to obtaininformation regarding the mailbox. For example, the data classificationsystem 120 may traverse messages identified by the change journal andobtain certain characteristics and other attributes of data within themailbox. Such an index may be a collection of metadata and/or otherinformation regarding the mailbox, and may be referred to herein as a“metabase.” Generally, metadata refers to data or information aboutdata, and may include, for example, data relating to storage operationsor storage management, such as data locations, storage managementcomponents associated with data, storage devices used in performingstorage operations, index data, data application type, or other data.

With this arrangement, if it is desired to obtain information regardingthe mailbox or characteristics of messages within the mailbox, a systemadministrator or system process may simply consult the metabase for suchinformation rather than iteratively access and analyze each data item inthe network. This may significantly reduce the amount of time requiredto obtain message information by substantially eliminating the need toobtain information from the source message. Such a data classificationsystem may associate previously stored messages with newly received ormodified messages in a mailbox via the data classification index. Forexample, the index may associate messages based on variousclassifications, such as message owners (individuals or groups), contentof the messages, resources used to create the messages, aginginformation, and so on.

Referring to FIG. 4B, a flow diagram illustrating a routine 450 forupdating an index of data classification information is shown. In step460, a data classification system 120 identifies a message that includesdata to be classified within a log file that stores changes to amailbox. For example, the data classification system 120 may access achange journal 300 and identify entries containing path and typeinformation for changes to a mailbox.

In step 470, the data classification system accesses the messagesidentified by the path and type information, and classifies data withinthe messages. For example, the data classification system may traversethe identified messages to obtain certain information regarding themessages such as any available metadata. Such metadata may includeinformation about messages or characteristics associated with datawithin the messages such as the data owner (e.g., the client or userthat generates the data or other data manager), the last modified time(e.g., the time of the most recent modification), the data size (e.g.,number of bytes of data), information about the data content (e.g., theapplication that generated the data, the user that generated the data,etc.), to/from information (e.g., an email sender, recipient orindividual or group on an email distribution list), creation date (e.g.,the date on which the data was created), file type (e.g., format orapplication type), last accessed time (e.g., the time the data was mostrecently accessed or viewed), application type (e.g., the applicationwhich generated the data), location/network (e.g., a current, past orfuture location of the data and network pathways to/from the data),frequency of change (e.g., a period in which the data is modified),business unit (e.g., a group or department that generates, manages or isotherwise associated with the data), and aging information (e.g., aschedule, which may include a time period, in which the data is migratedto secondary or long term storage), and so on. Further details regardingthe indexing of content may be found in U.S. patent application Ser. No.11/564,119, filed on Nov. 28, 2006, entitled SYSTEMS AND METHODS FORCLASSIFYING AND TRANSFERRING INFORMATION IN A STORAGE NETWORK, which isincorporated by reference in its entirety.

In step 480, the data classification system updates the index to includethe classification information, such as the information described above.Thus, the data classification system 120 may leverage the change journalto update an index associated with a mailbox. The data classificationindex may then facilitate content specific presentations of mailmessages to users, to be discussed herein.

Presenting a Mailbox Based on Changes to the Mailbox

In some examples, the system facilitates the presentation of messageswithin a mailbox based on the content of a change to the mailbox. Forexample, the system may present messages along with a received messagethat contain content similar to the content within the received message.

Referring to FIG. 5, a flow diagram illustrating a routine 500 forupdating a mailbox based on a change to the mailbox. In step 510, thesystem receives an indication of a change to a mailbox via a syncmechanism, such as event sync. The system may identify a messagecorresponding to the change, such as a newly received message within themailbox.

In step 520, the system identifies content within the change to themailbox. The system may look to a content index or classification index(or may first index or classify the message) to determine contentassociated with the message. For example, the system may identifyinformation within the subject line of a message.

In step 530, the system associates other messages within the mailboxwith the change to the mailbox. For example, the system may look to anindex of content, identify the other messages within the mailbox thatcontain a subject line similar to the subject line for a newly receivedmessage, and associate the other messages to the newly received message.Further details regarding the association of messages will be describedbelow.

In step 540, the system updates the presentation of the mailbox based onthe associated messages. For example, the system may present theassociated messages along with the newly received message, may provide alink or other indication that identifies the associated messages, maysort an inbox or other folder within the mailbox to order messages basedon the content of the new message or the associated messages, and so on.

For example, FIG. 6 illustrates a mailbox 600 presenting a list ofmessages to a user based on the content of a received message. Themailbox 600 presents messages to a user via certain messageidentification information, such as information associated with a nameof a sender 610, information associated with a subject of the message620, information associated with a date/time received 630, and so on.The mailbox may include a number of folders 640, including an inbox 645presented in the Figure. The inbox contains a number of messages 650,such as a newly received message 652 from “Mom” entitled “flight toEurope,” and a number of messages associated based on content withmessage 652. For example, message 654 includes the subject “Europeflight,” and message 656 includes the subject “re: your flight.” Theinbox may also include message 657 from “dad” regarding “Europe flight”.The inbox may include unassociated messages, such as message 658.

In this example, the system presents the newly received message 652along with messages determined to be associated with the newly receivedmessage, based on the content of the newly received message. The systemleverages the event sync system and change journal discussed herein todetermine that the mailbox has received a new message and to index thecontent of the new message without requiring access to all the messageswithin the mailbox. The system can then associate messages with thenewly received message via the index of content and present thesemessages along with the newly received message.

Thus, unlike typical systems that can only sort messages based on alimited number of fields, the system described herein is capable ofproviding a user with numerous advantages when a change to a mailboxoccurs. In the above example, the system provides a user with previousmessages that may relate to a newly received message, providingcontextual and historical information for the newly received message.Even though the associated messages have different subject lines, theyhave similar content within the subject lines (or within other fields ofthe messages), and the system, via a content index, can associate themessages based on the similar content. This enables the system todisplay messages to a user that may be similar in content but areotherwise unassociated (that is, they are not part of an email string,they do not contain the exact same subject line, and so on) when a newmessage is received at the mailbox.

In some examples, the system may display an indication that there aremessages associated with a newly received message. For example, aftercreating a change journal for changes to a mailbox, the system maydisplay an indicator 670 proximate to a newly received message 652,indicating to a user that other messages stored within the mailboxinclude similar content. This can be helpful when a user receives amessage related to a subject long after any previous correspondence wasreceived for that subject. The user may wish to quickly determine theimportance of the message, but may not wish to view all the associatedmessages. The indicator 670 may link to or expand the view to includethe associated messages.

In some cases, a user may access a mailbox via a mobile device thatpresents messages pushed to the mobile device via an enterprise server.Due to the limited display space on mobile devices, the user may wish toonly receive and/or view certain messages on his/her mobile device, ormay wish to retrieve certain messages that are stored within the user'smailbox, but have been deleted from the user's mobile device.

FIG. 7A illustrates a display screen 700 on a mobile device thatpresents one or more messages 710 to a user. An email program presents anewly received message, such as message 710 from “Jack Jones” entitled“Re: Your Car.” The user, away from his work computer, may wish to viewon his mobile device other messages related to his car that he receivedin a certain time period. The system may present a selection toolbar 720that presents options such as to sync 721 the device to include othermessages associated with the received message, may ask for a certaintime period of received messages 721724, and so on. Upon receiving aselection from the user, the system may retrieve any associated messagesthat satisfy the user's request, and present them to the user, shown asdisplay 730 in FIG. 7B.

FIG. 7B illustrates the display screen of FIG. 7A modified based on achange to a mailbox. In the example above, the user received a newmessage about his car, and caused the system to sync the device toreceive any associated messages. The system transfers a number ofdifferent messages and displays them as view 730. They are associatedwith the new message via content index or classification index describedherein. Thus, the system facilitates the user to retrieve messagesquickly and efficiently to his mobile device when a new message isreceived at the mobile device.

Other example processes facilitated by the system may include thefollowing:

Upon deletion of an email message from a user's mobile device, thesystem identifies the change to the mailbox and identifies other emailmessages associated with the deleted email message, and deletes theassociated messages. In some cases, the system may request authorizationfrom the user before deleting the messages.

When a user moves a message to a different folder (such as a folder fora specific project), the system identifies the change to the mailbox andidentifies other email messages associated with the moved emailmessages, and moves the other email messages to the folder.

Upon receiving a new message, the system may display all messagesassociated with the new message, and may create a new folder for thegroup of messages when requested by a user. These messages may be fromdifferent folders (inbox, sent messages, deleted messages, otherfolders). The system may facilitate building a historical context for areceived message, and then storing the messages (or additional copies ofthe messages or portions of the messages) within a specific folder. Thismay enable a user to build a quick history of a certain subject when anew email message is received in order to provide the user withinformation that may assist the user in responding to the message, amongother benefits.

In addition, the system may generate reports based on informationextracted from the change journal. For example, the system may generatereports for a given time period of changes, the content within thechanges and so on. The reports may identify changes associated withmessages having similar data classifications, changes associated withmessages having similar content, and so on. Components within the systemmay leverage information with the reports to update or modify storageoperations, periodically or dynamically.

CONCLUSION

From the foregoing, it will be appreciated that specific embodiments ofthe system have been described herein for purposes of illustration, butthat various modifications may be made without deviating from the spiritand scope of the system. Accordingly, the system is not limited exceptas by the appended claims.

Unless the context clearly requires otherwise, throughout thedescription and the claims, the words “comprise,” “comprising,” and thelike are to be construed in an inclusive sense, as opposed to anexclusive or exhaustive sense; that is to say, in the sense of“including, but not limited to.” The word “coupled”, as generally usedherein, refers to two or more elements that may be either directlyconnected, or connected by way of one or more intermediate elements.Additionally, the words “herein,” “above,” “below,” and words of similarimport, when used in this application, shall refer to this applicationas a whole and not to any particular portions of this application. Wherethe context permits, words hi the above Detailed Description using thesingular or plural number may also include the plural or singular numberrespectively. The word “or” in reference to a list of two or more items,that word covers all of the following interpretations of the word: anyof the items in the list, all of the items in the list, and anycombination of the items in the list.

The above detailed description of embodiments of the system is notintended to be exhaustive or to limit the system to the precise formdisclosed above. While specific embodiments of, and examples for, thesystem are described above for illustrative purposes, various equivalentmodifications are possible within the scope of the system, as thoseskilled hi the relevant art will recognize. For example, while processesor blocks are presented hi a given order, alternative embodiments mayperform routines having steps, or employ systems having blocks, in adifferent order, and some processes or blocks may be deleted, moved,added, subdivided, combined, and/or modified. Each of these processes orblocks may be implemented in a variety of different ways. Also, whileprocesses or blocks are at times shown as being performed in series,these processes or blocks may instead be performed in parallel, or maybe performed at different times.

The teachings of the system provided herein can be applied to othersystems, not necessarily the system described above. The elements andacts of the various embodiments described above can be combined toprovide further embodiments.

These and other changes can be made to the system in light of the aboveDetailed Description. While the above description details certainembodiments of the system and describes the best mode contemplated, nomatter how detailed the above appears in text, the system can bepracticed in many ways. Details of the system may vary considerably inimplementation details, while still being encompassed by the systemdisclosed herein. As noted above, particular terminology used whendescribing certain features or aspects of the system should not be takento imply that the terminology is being redefined herein to be restrictedto any specific characteristics, features, or aspects of the system withwhich that terminology is associated. In general, the terms used in thefollowing claims should not be construed to limit the system to thespecific embodiments disclosed in the specification, unless the aboveDetailed Description section explicitly defines such terms. Accordingly,the actual scope of the system encompasses not only the disclosedembodiments, but also all equivalent ways of practicing or implementingthe system under the claims.

While certain aspects of the system are presented below in certain claimforms, the inventors contemplate the various aspects of the system inany number of claim forms. For example, while only one aspect of thesystem is recited as embodied in a computer-readable medium, otheraspects may likewise be embodied in a computer-readable medium.Accordingly, the inventors reserve the right to add additional claimsafter filing the application to pursue such additional claim forms forother aspects of the system.

We claim:
 1. A non-transitory computer-readable medium storinginstructions which, when executed by a computing system cause thecomputing system to perform a method for displaying email messages to auser, the method comprising: accessing a message received at a mailbox,wherein the mailbox is hosted by a mail server; extracting contentassociated with the received message; identifying other messagescontained in the mailbox having content similar to the extractedcontent; and transmitting the identified other messages from the mailboxto a mail application of a mobile device associated with the mailbox,wherein the mail application locally stores on the mobile device fewerthan all messages stored in the mailbox on the mail server, and whereinthe mobile device is remotely located from the mail server hosting themailbox.
 2. The computer-readable medium of claim 1, further comprising:presenting the accessed message along with the identified other messagesvia a display of the mobile device.
 3. The computer-readable medium ofclaim 1, further comprising: generating a mailbox folder that containsthe accessed message and the identified other messages.
 4. Thecomputer-readable medium of claim 1, wherein identifying other messagescontained in the mailbox having content similar to the extracted contentincludes accessing an index of content, wherein the index includescontent contained by messages received at the mailbox and associatedwith the user.
 5. The computer-readable medium of claim 1, furthercomprising: presenting at least one identified other message directlyabove or below the accessed message in a presented list of messages,wherein the at least one identified other message resides in a folder ofthe mailbox different from a folder of the mailbox that includes thereceived message.
 6. The computer-readable medium of claim 1, whereinidentifying other messages contained in the mailbox having contentsimilar to the extracted content includes identifying at least onemessage having a subject line different than a subject line of thereceived message.
 7. The computer-readable medium of claim 1, whereinidentifying other messages contained in the mailbox having contentsimilar to the extracted content includes identifying at least onemessage that is not part of an email thread that includes the receivedmessage.
 8. The computer-readable medium of claim 1, wherein identifyingother messages contained in the mailbox having content similar to theextracted content includes identifying at least one message havingcontent within a body of the message that matches the extracted content.9. A method for displaying email messages to a user, the methodcomprising: accessing a message received at a mailbox, wherein themailbox is hosted by a mail server; extracting content associated with areceived message; identifying other messages contained in the mailboxhaving content similar to the extracted content, wherein at least one ofthe identified other messages is not part of a thread of email messagesthat includes the received message; transmitting the identified othermessages from the mailbox to a mail application of a computing deviceassociated with the mailbox, wherein the mail application locally storeson the computing device fewer than all messages stored in the mailbox,and wherein the computing device is remotely located from a mail serverhosting the mailbox; and presenting the received message along withinformation associated with the identified other messages to a user. 10.The method of claim 9, wherein presenting the received message includespresenting at least one identified other message directly above or belowthe received message in a presented list of messages, wherein the atleast one identified other message resides in a folder of the mailboxdifferent from a folder of the mailbox that receives the receivedmessage.
 11. The method of claim 9, wherein presenting the receivedmessage includes presenting an indication that other messages containcontent similar to the content of the received message.
 12. The methodof claim 9, wherein the other messages are stored in secondary storageand the received message is stored in primary storage.
 13. The method ofclaim 9, wherein identifying other messages contained in the mailboxhaving content similar to the extracted content includes identifying atleast one message having content within a body of the message thatmatches the extracted content.
 14. A system for performing a datastorage operation for data stored on a mail server, the systemcomprising: an identification component that identifies one or morechanges within a data store associated with a mailbox; a data transfercomponent configured to: access messages stored within the data storethat are associated with the one or more identified changes, and copythe accessed messages to a secondary copy of the data store; and amailbox component configured to: extract content from a message,identify other messages in the secondary copy of the data store aresimilar to the extracted content, transmit the other messages to a mailapplication on a computing device associated with the mailbox, whereinthe computing device is remotely located from the mailbox, and presentthe other messages using the mail application.
 15. The system of claim14, further comprising: a log component that generates one or more logentries, wherein each of the one or more log entries is associated withthe one or more changes to the data store; and a content indexingcomponent that: accesses the one or more log entries; and creates anindex of content associated with the messages stored within the datastore and associated with the one or more identified changes.
 16. Thesystem of claim 14, wherein the one or more changes within the datastore include movement of at least one electronic mail message from onefolder to another folder within the mailbox.
 17. The system of claim 14,wherein the one or more changes within the data store include deletionof at least one electronic mail message from a folder within themailbox.
 18. The system of claim 14, wherein the one or more changeswithin the data store include reception of at least one electronic mailmessage at an inbox folder within the mailbox.
 19. The system of claim15, wherein the content associated with the messages includes senderinformation, recipient information, subject information, message type,text within the body of the messages, or attachment information.